Pattern-Based Acquisition of Scientific Entities from Scholarly Article Titles

نویسندگان

چکیده

We describe a rule-based approach for the automatic acquisition of salient scientific entities from Computational Linguistics (CL) scholarly article titles. Two observations motivated approach: (i) noting aspects an article's contribution in its title; and (ii) pattern regularities capturing terms that could be expressed set rules. Only those lexico-syntactic patterns were selected easily recognizable, occurred frequently, positionally indicated entity type. The rules developed on collection 50,237 CL titles covering all articles ACL Anthology. In total, 19,799 research problems, 18,111 solutions, 20,033 resources, 1,059 languages, 6,878 tools, 21,687 methods extracted at average precision 75%.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Amusing titles in scientific journals and article citation

The present study examines whether the use of humor in scientific article titles is associated with the number of citations an article receives. Four judges rated the degree of amusement and pleasantness of titles of articles published over 10 years (from 1985 to 1994) in two of the most prestigious journals in Psychology, Psychological Bulletin and Psychological Review. We then examined the as...

متن کامل

Unsupervised concept based entity extraction from scientific titles

Œis paper studies the extraction and typing of entities from titles of academic literature, in order to gain a deeper understanding of their speci€c contributions and automate the construction of a problem-solution knowledgebase. To achieve this goal, we propose an unsupervised, domain independent, two phase algorithm to extract entity mentions and type them into appropriate concepts. In the €r...

متن کامل

Syntactic Structures in Research Article Titles from Three Different Disciplines: Applied Linguistics, Civil Engineering, and Dentistry

Deducing what a paper is about, titles are considered as the most important determinant of how many people will read the article. Therefore, studying the use of different syntactic structures and their rhetorical functions in titles is of great significance. The current study was set to investigate these structures used in research article titles in three disciplines of Applied Linguistics, Den...

متن کامل

task-based instruction, consciousness-raising and iranian efl learners acquisition and use of collocation

the present study sought to examine whether a task-based approach could have an impact on raising awareness of collocations. moreover, it sought to investigate the facilitative role of consciousness-raising tasks of collocations in the communicative instances of use. to this end, 68 intermediate learners of english were selected via a placement test. the participants were taught with classroom ...

15 صفحه اول

Automatic acquisition of Named Entities for Rule-Based Machine Translation∗

This paper proposes to enrich RBMT dictionaries with Named Entities (NEs) automatically acquired from Wikipedia. The method is applied to the Apertium English–Spanish system and its performance compared to that of Apertium with and without handtagged NEs. The system with automatic NEs outperforms the one without NEs, while results vary when compared to a system with handtagged NEs (results are ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-91669-5_31